Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

A fourier descriptor based character recognition engine implemented under the gamera open-source document processing framework

Identifieur interne : 001423 ( Main/Exploration ); précédent : 001422; suivant : 001424

A fourier descriptor based character recognition engine implemented under the gamera open-source document processing framework

Auteurs : Jared Hopkins [États-Unis] ; Tim Andersen [États-Unis]

Source :

RBID : Pascal:05-0361255

Descripteurs français

English descriptors

Abstract

This paper discusses the implementation of an engine for performing optical character recognition of bi-tonal images using the Gamera framework, an existing open-source framework for building document analysis applications. The OCR engine uses features that are based on the Fourier descriptor to distinguish characters, and is designed to be able to handle character images that contain multiple boundaries. The algorithm works by assigning to each character image a signature that encodes the boundary types that are present in the image as well as the positional relationships that exist between them. Under this approach, only images having the same signature are comparable. Effectively, a meta-classifier is used which first computes the signature of an input image and then dispatches the image to an underlying neural network based classifier which is trained to distinguish between images having that signature. The performance of the OCR engine is evaluated on a set of sample images taken from the newspaper domain, and compares well with other OCR engines. The source code for this engine and all supporting modules is currently available upon request, and will eventually be made available through an open-source project on the sourceforge website.


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en" level="a">A fourier descriptor based character recognition engine implemented under the gamera open-source document processing framework</title>
<author>
<name sortKey="Hopkins, Jared" sort="Hopkins, Jared" uniqKey="Hopkins J" first="Jared" last="Hopkins">Jared Hopkins</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Computer Science Department, Boise State University</s1>
<s2>Boise, Idaho 83725</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<wicri:noRegion>Boise, Idaho 83725</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Andersen, Tim" sort="Andersen, Tim" uniqKey="Andersen T" first="Tim" last="Andersen">Tim Andersen</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Computer Science Department, Boise State University</s1>
<s2>Boise, Idaho 83725</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<wicri:noRegion>Boise, Idaho 83725</wicri:noRegion>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">INIST</idno>
<idno type="inist">05-0361255</idno>
<date when="2005">2005</date>
<idno type="stanalyst">PASCAL 05-0361255 INIST</idno>
<idno type="RBID">Pascal:05-0361255</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000457</idno>
<idno type="wicri:Area/PascalFrancis/Curation">000331</idno>
<idno type="wicri:Area/PascalFrancis/Checkpoint">000441</idno>
<idno type="wicri:doubleKey">1017-2653:2005:Hopkins J:a:fourier:descriptor</idno>
<idno type="wicri:Area/Main/Merge">001471</idno>
<idno type="wicri:Area/Main/Curation">001423</idno>
<idno type="wicri:Area/Main/Exploration">001423</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a">A fourier descriptor based character recognition engine implemented under the gamera open-source document processing framework</title>
<author>
<name sortKey="Hopkins, Jared" sort="Hopkins, Jared" uniqKey="Hopkins J" first="Jared" last="Hopkins">Jared Hopkins</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Computer Science Department, Boise State University</s1>
<s2>Boise, Idaho 83725</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<wicri:noRegion>Boise, Idaho 83725</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Andersen, Tim" sort="Andersen, Tim" uniqKey="Andersen T" first="Tim" last="Andersen">Tim Andersen</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Computer Science Department, Boise State University</s1>
<s2>Boise, Idaho 83725</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<wicri:noRegion>Boise, Idaho 83725</wicri:noRegion>
</affiliation>
</author>
</analytic>
<series>
<title level="j" type="main">SPIE proceedings series</title>
<idno type="ISSN">1017-2653</idno>
<imprint>
<date when="2005">2005</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt>
<title level="j" type="main">SPIE proceedings series</title>
<idno type="ISSN">1017-2653</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Algorithm</term>
<term>Automatic classification</term>
<term>Character recognition</term>
<term>Document analysis</term>
<term>Document processing</term>
<term>Feature extraction</term>
<term>Implementation</term>
<term>Multiple image</term>
<term>Neural network</term>
<term>Optical character recognition</term>
<term>Pattern recognition</term>
<term>Performance evaluation</term>
<term>Signal classification</term>
<term>Signal processing</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr">
<term>Reconnaissance caractère</term>
<term>Implémentation</term>
<term>Traitement document</term>
<term>Reconnaissance optique caractère</term>
<term>Analyse documentaire</term>
<term>Image multiple</term>
<term>Algorithme</term>
<term>Classification automatique</term>
<term>Réseau neuronal</term>
<term>Evaluation performance</term>
<term>Reconnaissance forme</term>
<term>Classification signal</term>
<term>Extraction caractéristique</term>
<term>Traitement signal</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">This paper discusses the implementation of an engine for performing optical character recognition of bi-tonal images using the Gamera framework, an existing open-source framework for building document analysis applications. The OCR engine uses features that are based on the Fourier descriptor to distinguish characters, and is designed to be able to handle character images that contain multiple boundaries. The algorithm works by assigning to each character image a signature that encodes the boundary types that are present in the image as well as the positional relationships that exist between them. Under this approach, only images having the same signature are comparable. Effectively, a meta-classifier is used which first computes the signature of an input image and then dispatches the image to an underlying neural network based classifier which is trained to distinguish between images having that signature. The performance of the OCR engine is evaluated on a set of sample images taken from the newspaper domain, and compares well with other OCR engines. The source code for this engine and all supporting modules is currently available upon request, and will eventually be made available through an open-source project on the sourceforge website.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>États-Unis</li>
</country>
</list>
<tree>
<country name="États-Unis">
<noRegion>
<name sortKey="Hopkins, Jared" sort="Hopkins, Jared" uniqKey="Hopkins J" first="Jared" last="Hopkins">Jared Hopkins</name>
</noRegion>
<name sortKey="Andersen, Tim" sort="Andersen, Tim" uniqKey="Andersen T" first="Tim" last="Andersen">Tim Andersen</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 001423 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 001423 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     Pascal:05-0361255
   |texte=   A fourier descriptor based character recognition engine implemented under the gamera open-source document processing framework
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024